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In this paper we unify the Markov theory of a variety of different types of graphs used in 
graphical Markov models by introducing the class of loop less mixed graphs, and show that all 
independence models induced by m-separation on such graphs are compositional graphoids. We 
focus in particular on the subclass of ribbonless graphs which as special cases include undirected 
graphs, bidirected graphs, and directed acyclic graphs, as well as ancestral graphs and summary 
graphs. We define maximality of such graphs as well as a pairwise and a global Markov property. 
We prove that the global and pairwise Markov properties of a maximal ribbonless graph are 
equivalent for any independence model that is a compositional graphoid. 
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1. Introduction 

1.1. Introduction and motivation 

Graphical Markov models have become widely used in recent years. The models use 
graphs to represent conditional independence relations for systems of random variables, 
with nodes of the graph corresponding to random variables and edges representing de- 
pendencies. Several classes of graphs with various independence interpretations have 
been described in the literature. These range from undirected graphs with simple sepa- 
ration for derivation of independencies [19] to various forms of mixed graphs [18; 25; 29], 
including chain graphs with several different separation criteria used for derivation of 
independencies [10; 5; 17; 2; 8]. 

In spite of the significant differences among these graphs, their structural similarities 
motivate an attempt to unify them. For this purpose, we introduce the class of loop- 
less mixed graphs and let them entail independence models using the same separation 
criterion, m-separation. This unification covers many graphical independence models in 
the literature with some independence models for chain graphs forming a notable excep- 
tion; see Sec. 4 for further details. We show that any independence model generated by 
m-separation in a loopless mixed graph is a compositional graphoid. 

A common motivation for defining MC-graphs [18], summary graphs [29], and ancestral 
graphs [25], is to represent independence relations implied by marginalisation over and 
conditioning on sets of variables satisfying the Markov property of a directed acyclic 
graph (DAG). The focus of our study is on a subclass of loopless mixed graphs which we 
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shall term ribbonless. The class of ribbonless graphs is sufficiently rich to serve the same 
purpose, as it can be considered a modification of MC graphs, and it contains summary 
graphs and ancestral graphs as special cases. 

For ribbonless graphs, we define global and pairwise Markov properties, the latter 
being associated with interpreting missing edges in the graph as representing conditional 
independencies. We prove as our main result that a compositional graphoid independence 
model over a maximal ribbonless graph satisfies the global Markov property if and only 
if it satisfies the pairwise Markov property. 

1.2. Some early results on Markov properties 

The concepts of pairwise and global Markov properties for undirected graphs were in- 
troduced in [13] in the context of random fields and shown to be equivalent for positive 
densities. Alternative proofs were later given independently by several authors, for ex- 
ample [12; 3]; see also [4]. An abstract variant of this theorem was proven in [21] for 
independence models satisfying graphoid axioms as these are satisfied by probabilistic 
distributions with positive densities; see also [28] and [11]. Independence models for undi- 
rected graphs were discussed comprehensively in Chapter 3 of [19]. 



A global Markov property that uses the m-separation criterion and a pairwise Markov 
property were defined in [25] for maximal ancestral graphs without considering conditions 
under which they are equivalent. We use a generalisation of these Markov properties for 
maximal ribbonless graphs, which contains maximal ancestral graphs as a subclass, and 
prove their equivalence for compositional graphoids. This is the main result of the present 
paper. It has been mentioned as a conjecture in [14]. 

1.3. Structure of the paper 

In the next section we introduce the basic concepts of graph theory, general and proba- 
bilistic independence models, and compositional graphoids. 

In Sec. 3 we introduce the class of loopless mixed graphs and additional graph theo- 
retical definitions. We also associate the m-separation criterion to this class, and prove 
for any loopless mixed graph that the independence model induced by m-separation is a 
compositional graphoid. 

In Sec. 4 we introduce the class of ribbonless graphs and the concept of anterior graphs. 
We describe the relations between these as well as subclasses of loopless mixed graphs 
that have been discussed in the literature. 

In Sec. 5 we introduce the concept of maximality by demanding that any additional 
edge will change the independence model. It is shown that ribbonless graphs are not 
necessarily maximal, and conditions for maximality are given. 

In Sec. 6 we define a pairwise and a global Markov property for independence mod- 
els for ribbonless graphs, and prove our main result: that pairwise and global Markov 
properties are equivalent for compositional graphoid independence models over maximal 
ribbonless graphs. 
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2. Basic definitions and concepts 

In this section we introduce basic definitions and notation for independence models, 
graphs, and compositional graphoids. 

2.1. Basic graph theoretical definitions 

A graph G is a triple consisting of a node set or vertex set V, an edge set E, and a 
relation that with each edge associates two nodes (not necessarily distinct), called its 
endpoints. When nodes i and j are the endpoints of an edge, they are adjacent and we 
write i ^ j. We say the edge is between its two endpoints. We usually refer to a graph as 
an ordered pair G = {V,E). Graphs Gi = {Vi,Ei) and G2 = (V2,i52) are called equal if 
{Vi,Ei) — (V2,i?2)- In this case we write G\ = G2. 

Notice that our graphs are labeled, i.e. every node is considered as a different object. 
Hence, for example, graph i j k is not equal to j i k. 

A loop is an edge with the same endpoints. Multiple edges are edges with the same 
pair of endpoints. A simple graph has neither loops nor multiple edges. 

A subgraph of a graph Gi is graph G2 such that V^(G2) C V{Gi) and E{G2) ^ E{Gi) 
and the assignment of endpoints to edges in G2 is the same as in Gi. An induced subgraph 
by nodes A C y is a subgraph that contains all and only nodes in A and all edges between 
two nodes in A. A subgraph induced by edges _B C is a subgraph that contains all and 
only edges in B and all nodes that are endpoints of edges in B. 

A walk is a list (wq, ei, wi, . . . , e^, w^) of nodes and edges such that for 1 < i < fc, the 
edge Ci has endpoints Vi-i and Vi. A path is a walk with no repeated node or edge. If 
the graph is simple then the path can be uniquely determined by an ordered sequence 
of node sets. Throughout this paper we use node sequences to describe paths even in 
graphs with multiple edges, as it usually is apparent from the context which of multiple 
edges belong to the path. We say a path is between the first and the last nodes of the 
list in G. We call the first and the last nodes endpoints of the path and all other nodes 
inner nodes. 

If TTi = (i = ia,ii,...,in,h) and 7r2 = {h, jm, jm-i, ■ ■ ■ Jo = j) are paths, their 
combination 7ri2 = 7ri07r2 is the path 7ri2 = (*,..., ip-i, k,jq-i, . . . ,j), where k = ip = jq 
is the first node of tti which is on both paths. If fc = /i then 7ri2 is simply the concatenation 
of the two paths. In general, the concatenation of two paths will be a walk and not a 
path as the paths may intersect in more than one point. 

A subpath of a path tt is a path that can be considered a subgraph of tt with the 
ordering associated with tt. A cycle in a graph G is a simple subgraph whose nodes can 
be placed around a circle so that two nodes are adjacent if they appear consecutively 
along the circle. 
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2.2. Independence models 

An independence model J over a set y is a set of triples (X, Y \ Z) (called indepen- 
dence statements), where X, Y, and Z are disjoint subsets of V and Z can be empty, 
and (0, Y \ Z) and {X, \ Z) always being included in J. The independence statement 
{X, Y\Z) is interpreted as "X is independent of Y given Z" . 

An independence model over a set ^ is a semi-graphoid if for disjoint subsets A, 
B, C, and D of V, it satisfies the four following properties: 

1. {A,B\C) ej if and only if (B,A\C) ej (symmetry); 

2. if {A,BU D\C) e J then (A, B | C) G J and {A,D\C) £ J (decomposition); 

3. if {A, BUD \C) eJ then (A, B | CUD) G J and (A,!) | CUB) G J (weafc wmon); 

4. {A,B\CUD) e J and (A, D | C) G J if and only if (A, B U £> | C) G J (conirac- 

A semi-graphoid for which the reverse implication of the weak union property holds is 
said to be a graphoid, that is 

5. if {A,B\ CUD) G J and {A,D\ CUB) G J then (A.BUD \C) e J (intersection). 

Furthermore, a graphoid or semi-graphoid for which the reverse implication of the de- 
composition property holds is said to be compositional, that is 

6. if {A,B\C) eJ and {A,D\C) eJ then {A,BUD\C) e J (composition). 

2.3. Probabilistic conditional independence models 

The most common independence models are induced by probability distributions. Con- 
sider a set V and a collection of random variables (Xa)a£V with state spaces Xa, a € V 
and joint distribution P. We let Xa = (Xy)y^A etc. for each subset A of V. For disjoint 
subsets A, B, and C of F we use the short notation AALB\C to denote that Xa is 
conditionally independent of Xb given Xq [7; 19], i.e. that for C Xa 

P(Xa en\XB=XB,Xc = xc) = P(Xa en\Xc^ xc). 

We can now induce an independence model J(P) by letting 

(A, B\C) e J(P) if and only if A ALB \C w.r.t. P. 

We say that an independence model is probabilistic if there is a distribution P such 
that J' = J(P)- We then also say that P is faithful to J . 

Probabilistic independence models are always semi-graphoids [21], whereas the con- 
verse is not necessarily true; see [28]. If P has strictly positive density, the induced 
independence model is also graphoid; see e.g. Proposition 3.1 in [19]. If the distribution 
P is a regular multivariate Gaussian distribution, J^(P) is a compositional graphoid. This 
follows from the fact that for such a distribution 

AALB\C ^AuBuc = for all a G A, /? G B, 
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where k'^[JS^JC is the a/3 entry in the concentration matrix of the distribution of X^ubuc- 
Probabihstic independence models with positive densities are not in general compo- 
sitional graphoids; this only holds for special types of multivariate distributions such as 
the Gaussian mentioned above and, say, the symmetric binary distributions used in [31]. 

3. Independence models for mixed graphs 

3.1. Mixed graphs 

A mixed graph is a graph containing three types of edges denoted by arrows, arcs (bi- 
directed arrows), and lines (full lines). Notice that, in contrast to places in the literature, 
we allow multiple edges of the same type. A loopless mixed graph (LMG) is a mixed 
graph that does not contain any loops (a loop may be line, arrow, or arc). For an arrow 
j — >~i, we say that the arrow is from j to i. We also call j a parent of i, i a child of j 
and we use the notation pa(i) for the set of all parents of i in the graph. In the cases of 
i — or i < > j we say that there is an arrowhead at j or pointing to j. 

A path {i = iQ,ii, . . . ,in = j) is direction-preserving from i to j if all ikik+i edges are 
arrows pointing from ik to ik+i- If there is a direction-preserving path from j to i then 
j is an ancestor of i and i is a descendant of j. We denote the set of ancestors of i by 
an(i). 

A V- configuration is a path with three nodes and two edges. Notice that originally 
[16] and in most texts, the cndpoints of a V-configuration arc by definition not adjacent 
whereas we do not use this restriction. 

In a mixed graph the inner node of three V-configurations i — 7~t^ — j, z < > — j, 
and i < > t < > j is a collider (or a collider node) and the inner node of any other V- 
configuration is a non-collider (or a non-collider node) in the V-configuration or more 
generally on a path of which the V-configuration is a subpath. We shall also say that the 
V-configuration with inner collider or non-collidcr node is a collider or non- collider. We 
may speak of a collider or non-collidcr without mentioning the V-configuration or the 
path when this is apparent from the context. Notice that a node may be a collider on 
one path and a non-collider on another path. 

Two paths TTi and 7r2 (including V-configurations or edges) between i and j are called 
endpoint-identical if there is an arrowhead pointing to i in tti if and only if there is 
an arrowhead pointing to i in 1:2 and similarly for j. For example, the paths i — 7~j, 

i k < > j, and i — >~k^ — I < > j are all endpoint-identical as they have an arrowhead 

pointing to j but no arrowhead pointing to i on the paths. 

3.2. Anterior graphs and sets 

The anterior graph of a loopless mixed graph G, denoted by G*, is the graph obtained 
from G by repeatedly removing arrowheads pointing to nodes that are the endpoints of 
a line, i.e. by obtaining o and -< — o from — ^ o and < > o 
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respectively. A path (i = io, ii, . . . , in = j) from i to j in G* is an anterior path if it has 

the form i ii im — >- im+i — ' — ^ j- 

We shall say that i is anterior of j if there is an anterior path from i to j in G* . 
We use the notation ant(z) for the set of all anteriors of i. Notice that, since ancestral 
graphs have no arrowheads pointing to lines, we have G = G* for an ancestral graph. 
Thus our definition of anterior extends the notion of anterior used in [25] for ancestral 
graphs. However, it is different from and inconsistent with the definition of anteriors in 
[10] and [1]. 

For example, in the graph G in Fig. 1(a), ant(i) ~ {l,h,j,p} and ant(p) = {l,h,j}. 
This can be seen by looking at the anterior paths {p,j, h, I, i) from p to i and (/, h,j,p) 
from / to p (as well as from p to in Fig. 1(b). 

k p k p 

, (I I (I 

i — > / ^ h — ^ j i / h j 

(a) (b) 
Figure 1. (a) A mixed grapli G. (b) The anterior graph G* of G. 

We first show that transitivity holds for anteriors. 

Lemma 1. For any loopless mixed graph it holds that if i ^ Bint{j) and j £ ant(fc) then 
i e ant(fc). 

Proof. If i S ant(j) and j 6 ant(fc), G* has anterior paths tti from i to j and ^2 from j 
to k. As no arrowhead meets a line in G* their combination tti o tt2 is an anterior path 
from i to j in G*. □ 

Here we also introduce a Lemma that is used in several proofs of this paper. 

Lemma 2. Let G be a loopless mixed graph. If i E ant(j) \ an(j) then either i or a 
descendant of i is the endpoint of a line in G. 

Proof. The proof uses induction on the number of arrowheads removed from G to obtain 
G*. For the base, if G = G* it follows immediately from the definition of an anterior 
path that i must be the endpoint of a line or we would have i £ an(j). 

Next, suppose that G* is obtained from G by removing n + 1 arrowheads and let G 
be obtained from G by removing a single arrowhead pointing to a line from G. Then G* 
is also the anterior graph of G, but with only n arrowheads needing removal. Thus if 
i £ ant(7) in G, it is also anterior to j in G. Consider now two cases: 
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Case I. Assume i is an ancestor of j in G. Since i is not an ancestor of j in G, G 
must have been obtained by turning an arc into an arrow. Say this arrowhead points to 
h. Then h is an endpoint of a Une and it is a descendant of i in G. 

Case II. If i is not an ancestor of j in G, the inductive hypothesis yields that i is 
either adjacent to a hne ih in G or or has a descendant h in G which is the endpoint of a 
hne in G. Let h be the node adjacent to a hne in G. If the arrowhead removed is not on 
the direction-preserving path tt from i to h the conclusion obviously follows. Else, there 
must be node fc on tt which is adjacent to a line in G and can be used instead of h. □ 

3.3. The m-separation criterion 

Here we define a separation criterion for LMGs. We use this criterion to induce indepen- 
dencies on LMGs and its subclasses defined in Section 3. 

Wc first define an m-connecting path: Let C be a subset of the node set of an LMG. 
A path is m-connecting given C if all its collider nodes are in C U an(C) and all its 
non-collider nodes are outside C. For two disjoint subsets of the node set A and B, we 
say that C m-separates A and B if there is no m-connecting path between A and B given 
C. In this case we use the notation AJ-„iB \ C. Notice that the m-separation criterion 
induces an independence model Jm{G) on G by A ± ^B \ G <J=> {A, i? | G) G Jm{G). 

We note that m-separation is unaffected if we replace multiple edges of the same type 
with a single edge of that type. The m-separation criterion for LMGs is the same as 
the separation criterion defined in [25]. It is an extension of the d-separation criterion 
introduced in [21]. Clearly, m-separation is also an extension of simple separation in an 
undirected graph, as then all edges are lines. 

For example, in graph G in Fig. 2 it holds that h e an(Z) and, thus, {i,h,j) is an 
m-connecting path given I. Therefore, {i,j\l) ^ Jm{G). We now have the following 

k — > / 




I ^h^ j 

Figure 2. A loopless mixed graph G for which (i, j j I) J'm(G). 

theorem. A similar result for the induced independence model for MC graphs was given 
in Proposition 2.10 of [18]. 

Theorem 1. For any loopless mixed graph G, the independence model Jrn{G) is a 
compositional graphoid. 

Proof. For G = {N, F) and disjoint subsets A, B, G, and D of N, we prove that ± „ 
satisfies the six compositional graphoid axioms: 
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1) Symmetry: If A _L „iB \ C then B _L [ C: If there is no m-connecting path between 
A and B given C then there is no m-connecting path between B and A given C. 

2) Decomposition: li AJ-rn{B U D)\C then AJ-„iD \ C: If there is no m-connecting 
path between A and B U D given C then there is no m-connccting path between A and 
DC (BUD) given C. 

3) Weak union: If A± rn{B U D) | C then A L mB \ {C \J D): From (2) we know that 
A^^mD\C and j4_L,„B|C. Suppose, for contradiction, that there exist m-connecting 
paths between A and B given C U D. Consider a shortest path of this type and caU it 
TT. If there is no inner collider node on tt then there is an m-connccting path between 
A and B given C, a contradiction. On tt all collider nodes are in (C U Z?) U an(C U D). 
If all collider nodes are in C U an(C) then there is an m-connecting path between A 
and B given C, again a contradiction. Hence consider the closest collider node i G (D U 
an(£')) \ (C U an(C)) to A on tt. Now since the nodes between A and i are not in i? U 13, 
there is an m-connecting path between A and i given C . li i C D then this is obviously 
a contradiction, otherwise there is an m-connecting path between A and a node k G D, 
such that i € an(fc), given C, a contradiction again. Therefore, there is no m-connecting 
path between A and B given C U D. 

4) Contraction: If _L | C and A _L | (B U C) then A _L „(B U D) | C: Suppose, 
for contradiction, that there exists an m-connecting path between A and BUD given 
C. Consider a shortest path of this type and call it tt. The path tt is either between A 
and B or between A and D. The path tt being between A and B contradicts A _L | C. 
Therefore, tt is between A and £). In addition, since all inner collider nodes on tt are in 
C U an(C) and because AL^D | (i? U C), an inner non-coUider node should be in B. 
This contradicts the fact that tt is a shortest m-connecting path between A and BUD 
given C. 

5) Intersection: If A ± „5 |(C U L>) and A _L „L> | (C U B) then A±rniB U D)\C: 
Suppose, for contradiction, that there exists an m-connecting path between A and BUD 
given C. Consider a shortest path of this type and call it tt. The path tt is either between 
A and B or between A and D. Because of symmetry between B and D in the formulation 
it is enough to suppose that tt is between A and B. Since all inner collider nodes on tt 
are in C U an(C) and because A±mB \ [C U D), an inner non-collider node should be 
in D. This contradicts the fact that tt is a shortest m-connecting path between A and 
B\JD given C. 

6) Composition: li A L mB \ C and A L mD \ C then A ± rn{B \J D)\C: Suppose, for 
contradiction, that there exist m-connecting paths between A and B U D given C. Con- 
sider a path of this type and call it tt. Path tt is either between A and B or between A 
and D. Because of symmetry between B and D in the formula it is enough to suppose 
that TT is between A and B. But this contradicts AJ-^B \ C. □ 

Theorem 1 implies that we can focus on establishing conditional independence for 
pairs of nodes, formulated in the corollary below. 

Corollary 1. For a loopless mixed graph G and disjoint subsets of its node set A, B, 
and C , it holds that A _L ,„i? | C if and only if i-L mj \ C for every nodes i G A and j G B. 
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4. Subclasses of loopless mixed graphs 

LMGs and their associated independence models induced by m-separation unify a variety 
of previously discussed graphical independence models. 

4.1. Chain graphs 

Important exceptions include certain independence models for chain graphs. Chain graphs 
themselves are LMGs, but at least four different Markov properties for chain graphs have 
been discussed in the literature. Drton [8] has classified them into (i) the LWF or block 
concentration Markov property, (ii) the AMP or concentration regression Markov prop- 
erty, (iii) a Markov property that is dual to the AMP Markov property, (iv) and the 
multivariate regression Markov property. When the chain components consist entirely of 
arcs, the multivariate regression property is identical to the one induced by m-separation. 
However, the independence model induced by m-separation in a chain graph is typically 
different from any of the other chain graph interpretations; see also [24; 22] and [20]. 

4.2. Ribbonless graphs 

The class of MC graphs, defined in [18], contains line loops and uses a difi^erent separation 
criterion for inducing an independence model. However, a small modification of an MC 
graph generates a so-called ribbonless graph [26] which is loopless and induces the same 
independence model as the MC graph, but by m-scparation. The remaining part of this 
paper deals with such graphs. 

A ribbon is a collider V-configuration {h,i,j) such that 

1. there is no cndpoint-identical edge between h and j i.e. there is no hj-aic in the 
case of h < > i < > j; there is no hj-line in the case of h — — j; and there is no 
arrow from h to j in the case of h — M < > j; 

2. i or a descendant of i is the endpoint of a line or on a direction-preserving cycle. 

If i or a descendant of i is the endpoint of a line then we say the ribbon is straight and 
if they are on a direction-preserving cycle we say the ribbon is cyclic. A ribbonless graph 
(RG) is an LMG that has no ribbons as induced subgraphs. Fig. 3 illustrates a straight 
ribbon (/i, and the simplest cyclic ribbon. 

Fig. 4(a) illustrates a graph containing a straight ribbon {h,i,j) and Fig. 4(b) illus- 
trates a ribbonless graph. Notice that {h,i,j) is not a ribbon here since there is a line 
between h and j and this is an cndpoint-identical edge. We proceed to establish that 
ribbonless graphs yield identical independence models to their anterior graphs and need 
the following lemma. 
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h. 



h 



k I 

(a) 



(b) 



Figure 3. (a) A straight ribbon {h,i,j) with ne(j) — 0. (b) The simplest cyclic ribbon {h,i,j). 

h > i -6 7 h 7 

A 



p — k ^ 1 p ^ k 

(a) (b) 

Figure 4. (a) A graph that is not ribbonless. (b) A ribbonless graph. 



Lemma 3. Let G be a ribbonless graph. If there is a collider V- configuration {i,j,k) in 
G that is non-collider in G* , then G has an ik-edge that is endpoint-identical to {i^j, k) . 

Proof. Suppose that (G = Go, Gi, . . . , G„ = G*) is a sequence of graphs, where each 
graph has been generated by removing one arrowhead pointing to a full line from the 
previous graph starting from G. 

Consider the first intermediate graph Gp+i where fc) turns into a non-collider 
V-configuration. We prove by reverse induction that, for each < 9 < p, {i,j,k) is a 
straight ribbon unless there is an endpoint-identical ifc-edge to {i,j,k). 

In Gp, the node i is obviously the endpoint of a line and the result holds. Thus we 
assume that the result holds for Gq. In Gq-i, it is easy to observe that if the line that 
makes the ribbon is an arrow pointing to another line or if an arrow on the direction- 
preserving cycle pointing to a line is an arc then i or a descendant of i is still the endpoint 
of a line. Therefore, the result holds in Gq-i. Therefore, by reverse induction, this result 
holds in G, and since G is ribbonless, in G there is an endpoint-identical ik-edge to 
{ij,k). □ 

For the graph G in Fig. 3 (a), the anterior graph G* is the graph where all edges 
become undirected. Clearly there is no endpoint-identical edge hj and the conclusion of 
Lemma 3 does not hold. This illustrates the role of a graph being ribbonless. 

Proposition 1. For a ribbonless graph G, it holds that J^m{G) = Jm{G*), i.e. G and 
G* are Markov equivalent. 

Proof. It is enough to prove that there is an m-connecting path between i and j given 
G in G if and only if there is an m-connecting path between i and j given G in G* . 
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Suppose that there is an m-connecting path between i and j given C in G. All non- 
colliders on the path in G are preserved in G* . In addition, by Lemma 3, a collider 
V-configuration (i,j,k) becomes non-collider if there is an endpoint-identical ifc-edge 
to {i,j,k). In this case the «fc-edge can be used instead of {i,j,k) to establish an m- 
connecting path in G*. 

Conversely, suppose that there is an m-connecting path between i and j given C in 
G* . Collider V-configurations are collider V-configurations in G, and if a non-collider 
V-configuration {i,j,k) has been collider in G then, by Lemma 3, one can again use 
the ifc-edge instead of {i,j,k). Thus the only thing that remains to be proven is that a 
direction-preserving path pointing to a member of C in G remains direction-preserving 
in G*. 

In this case, by the same argument as in Lemma 3, if for the collider V-configuration 
fc), where j £ an(G), the arrowhead of an arrow on the direction-preserving path 
in G is taken away then {i,j,k) is a ribbon unless there is an endpoint-identical ik-edge 
to {i,j,k). Hence again we can use the «A;-edge instead of {i,j,k) for establishing an 
m-connecting path. □ 

Thus the absence of ribbons ensures that the Markov property is unchanged by forming 
the anterior graph G*. Again, as the anterior graph G* of the graph G in Fig. 3 (a) is 
the graph will all edges becoming undirected, we have h _L mj in G but not h _L mj in G* , 
illustrating that absence of ribbons is essential for the Markov equivalence of G and G* . 

Independence models induced by m-separation in a ribbonless graph can be induced by 
marginalisation over and conditioning on a DAG-independence model [26]. This implies 
that independence models corresponding to RGs are probabilistic, i.e. any RG has a 
faithful probability distribution. 

4.3. Other subclasses of loopless mixed graphs 

Other subclasses of LMGs that use m-separation and have been discussed in the literature 
are summary graphs [29], ancestral graphs [25], acyclic directed mixed graphs [27; 23], 
undirected or concentration graphs [6; 19], bidirected or covariance graphs [5; 15; 30; 9], 
and the class of directed acyclic graphs [16; 21; 11]. 

The use of some of the above graphs are motivated by representing independence 
models obtained by marginalisation over and conditioning on subsets of the node set of 
a DAG. For those graphs, arcs indicate marginalisation and lines indicate conditioning. 

The diagram in Fig. 5 illustrates the hierarchy of subclasses of LMGs and their associ- 
ated independence models generated by m-separation. For example, it can be seen from 
the diagram that bidirected graphs are also ancestral graphs, since they form a subclass 
of multivariate regression chain graphs, which again form a subclass of ancestral graphs. 
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Looploss mixed graphs 



Y 

Ribbonloss graphs 



Y 

Summary graphs 




Undirected graphs Directed acyclic graphs Bidirected graphs 

Figure 5. The hierarchy of subclasses of LMGs. 

5. Maximal ribbonless graphs 

Among the independence models over the node set V oi a. graph G, those that are of 
interest to us conform with G, meaning that i j in G imphes \ C) ^ J for any 
G C y \ Henceforth, we assume that independence models J conform with G, 

unless otherwise stated. 

For example, the independence model J = {(«, ^ | j), (i, fc | 0)} conforms with the graph 
G in Fig. 6, whereas J ~ {{ij \ {hi \ 0)} does not conform with G because of the 
independence statement (i, j | 0). 



i J k 

Figure 6. The independence model J — {{i,l \ j) , {i, k\ 0}} conforms with G whereas J = 
{iij I 0>} does not. 

A ribbonless graph G is called maximal if by adding any edge to G the independence 
model induced by m-separation changes. Note that in [29] a graph that is maximal is 
called an independence graph. 
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The independence models on RGs induced by m-separation conform with the graphs; 
hence for maximal graphs, adding an edge to the graph makes the independence model 
smaller. Therefore, we have the lemma below. 

Lemma 4. A graph G = {V, E) is maximal if and only if for every pair of non-adjacent 
nodes i and j of V , there exists a subset C o/ V \ {i, j} such that i _L mj | C. 

Proof. The result follows directly from the definition of maximality. □ 

RGs are not maximal in general. To see this consider the RG in Fig. 7. There is no 
G such that i-Lmj \ G. This is because if fc € C, the path i — < > j is m-connccting 
given G, and if A: ^ C, i — 7^k — 7~j is m-connecting given G. 

i — ^k^^j 

Figure 7. A non-maximal RG. 

To characterise maximal RGs we need the following notion: A path {j, qi,q2, . . . ,qp,i) 
is a primitive inducing path between i and j if and only if for every n, 1 < ?^ < p, 

(i) Qn is a collider on the path; and 

(ii) Qn e an({i} U {j}). 

Note in particular we consider any edge between i and j to be a primitive inducing path. 
Also, in Fig. 7, {i^k,j) is a primitive inducing path. For ancestral graphs, a primitive 
inducing path is an inducing path as defined in [25] with il/ = C = 0. 

Next we need the following lemmas. These also establish a pairwise Markov property 
for maximal RGs. 

Lemma 5. A non- collider node k on a path tt between i and j in a ribbonless graph 
G is either in ant(z) U ant(j) or an anterior of a collider node h on tt. Moreover, the 
relevant subpath of tt between k and i, j or h is an anterior path in G* . 

Proof. Let fc = be a non-collider node on a path tt = (i = iq, «i, . . . , in — ])■ Then 
from at least one side (say from im-i) there is no arrowhead on tt pointing to k. By 
moving towards i on the path as long as ip, 1 < p < m — 1, is non-collider on the path, 
we obtain that k G ant(ip_i). This implies that if no ip is a collider then k £ ant(i) and 
hence the lemma follows. □ 

Lemma 6. For nodes i and j in an RG that are not connected by any primitive inducing 
paths (and hence i 9^ j), it holds that z _Lmj | (ant(z) U ant(j)) \ {«, j}- 

Proof. Suppose, for contradiction, there is an m-connecting path between i and j given 
(ant(i) U ant(j)) \ {i,j} and denote a shortest such path by tt. If there is a non-collider 
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node k on tt then, by Lemma 5, k is either in ant(i) U ant(j) or it is an anterior of a 
coUider node on tt. But since tt is m-connecting given (ant(z) U ant(j)) \ colhder 
nodes are in ant(i) U ant(j) themselves. Hence k G ant(z) U ant(j), which contradicts the 
fact that TT is m-connecting. Therefore, all inner nodes of tt must be colliders. 

Now we know that all inner nodes of tt are in ant(i) U ant(j) and i 7^ j. If, for a 
collider V-configuration {r,l,s) on tt, I G (ant(i) U ant(j)) \ (an(i) U an(j)) then, by 
Lemma 2 and since the graph is ribbonless, there is an endpoint-identical rs-edgc to the 
V-configuration, which contradicts tt being shortest. Therefore, / S an(z) U an(j), which 
implies that tt is primitive inducing, again a contradiction. Therefore, there is no m- 
connecting path between i and j given (ant(i)Uant(7))\{i, j}, and hence i _L mj \ (ant(i)U 
ant{j)) \{i,j}. □ 

Next, in Theorem 2 we give a necessary and sufficient condition for an RG to be 
maximal. The analogous result for ancestral graphs was proved in Theorem 4.2 of [25]. 

Theorem 2. A ribbonless graph G is maximal if and only if G does not contain any 
primitive inducing paths between nan- adjacent nodes. 

Proof. If there exists a pair of non-adjacent nodes between i and j connected by a 
primitive inducing path tt then regardless of whether the inner nodes are in CUan(C) or 
not, an m-connccting path between i and j can be found. Therefore, G is non-maximal. 
More precisely, suppose that tt = {i = io,ii,...,i„ = j). We trace an m-connecting 
path between i and j by the following method. We start from i. We consider the closest 
node fc = i(j to j on tt such that it is in an(i) and the nodes on the direction-preserving 
path are not in C. We proceed to k through the direction-preserving path. Hence all ip, 
q < p < n ~ I are either in an(C) or in an(j) \ an(C). If ip is in C or if it is not in C 
or an(j) \ an(C) then we proceed to ip+i. If it is not in C but in an(j) \ an(C) then we 
proceed to j through the nodes of the direction-preserving path. 

By letting C = (ant(i) U ant(j)) \ {i, j} for every non-adjacent nodes i and j, the other 
direction follows from Lemmas 4 and 6. □ 

For other special types of graphs that arc subclasses of RGs, the condition for maxi- 
mality of RGs may get further simplified. Among the subclasses of RGs that have been 
mentioned in this paper, summary graphs, ancestral graphs, and acyclic directed mixed 
graphs are not necessarily maximal, while all others are maximal. This can be seen by 
checking whether primitive inducing paths are permissible in each subclass. 

A Markov equivalent maximal graph can be generated from a non-maximal graph 
by adding endpoint-identical edges to a primitive inducing path between a pair of non- 
adjacent nodes. We shall not give details of this here in this paper. The following lemma 
establishes that anterior graphs of maximal graphs are themselves maximal. 

Lemma 7. Let G be a ribbonless graph and G* its anterior graph. Then if G is maximal, 
so is G* . 
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Proof. If, for contradiction, G* is not maximal, then Theorem 2 imphes that there is a 
primitive inducing path in G* between non-adjacent nodes i and j. Consider a shortest of 
such primitive inducing paths between i and j and denote it by tt. We know that all inner 
nodes of tt are colliders in G* . This trivially implies that all inner nodes of tt are colliders 
in G too. In addition, each inner node k on tt is in an({i,j}) in G*. In G, fc £ an({i, j}) 
unless an arrow on the direction-preserving path from fc to i or j is an arc turning into 
an arrow in G*. In this case k is an ancestor of a node that is the endpoint of a line. 
Hence the V-configuration {h, k, I) on tt is a ribbon unless there is an endpoint-identical 
hl-edge to the V-configuration, which contradicts the fact that tt is a shortest. Therefore, 
TT is a primitive inducing path in G, a contradiction. Hence G* is maximal. □ 

6. Markov properties for ribbonless graphs 

In this section we give a precise definition of the global and pairwise Markov properties 
for an independence model J' defined over the node set of a ribbonless graph. Further 
we show that these two Markov properties are equivalent for a maximal ribbonless graph 
if Jj is also a compositional graphoid. This result is a direct generalisation of the similar 
result of [21] for undirected graphs and graphoids. 

6.1. Global and pairwise Markov properties 

For a ribbonless graph G = {V,E), an independence model J' defined over V satisfies 
the global Markov property w.r.t. G if it holds for A, B, and G disjoint subsets of V that 

A±r,B\C =^ {A,B\C) e J. 

Similarly, an independence model defined over V satisfies the pairwise Markov 
property w.r.t. G if it holds for any nodes i and j that 

I'^j =^ (i,j|(ant(i)UantO-))\{i,j}) G J. 

For example, for the graph in Fig. 8, the pairwise Markov property would imply that 
{i,m\{k,l,h}) as ant(i) ~ {k,l,h,m} and ant(m) = {l,h}. It would also imply that 
{l,p\{h,m}). 

Clearly, the independence model J^m{G) induced by m-separation always satisfies the 
global Markov property w.r.t. G. By Lemma 4, Lemma 6, and Theorem 2, Jm{G) satisfies 
the pairwise Markov property if and only if G is maximal. 

6.2. Equivalence of pairwise and global Markov properties 

Before establishing the main result of this section, we need two lemmas. 
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/ h 



i m — 

Figure 8. The pairwise Markov property for this RG implies, for example, {i, m \ {k, I, h}). The 
global Markov property would for example imply {{i, k},j \ I). 

Lemma 8. Let J be a compositional graphoid over a set V and M and C be disjoint 
subsets ofV. It then holds that the marginal independence model a{J', A/) = {(A, B \C) : 
(A, B \C) ^ J and (AUi?UC)nM = 0}, which is defined over Y\M , is a compositional 
graphoid. 

Proof. All the six compositional graphoid properties for a{J', M) follow trivially from 
the facts that for A, B, and C such that {AL>BUC)nM = 0, {A, B\C} e a{J, M) if 
and only if (A, B\C) ^ J, and J satisfies the six properties. □ 

The following lemma gives sufficient conditions for the combination of two m-connecting 
paths in anterior graphs to be m-eonnccting. 

Lemma 9. Let G* he the anterior graph of a rihhonless graph G and suppose that there 
are paths tti ~ i^i = zq, ^i, ■ • ■ , 

h) between i and h and tt2 = {h,jm,jm-i,- --Jo = j) 
between h and j which are m-connecting given G. The combination 7ri2 = tti o7r2 is then 
an m-connecting path between i and j given G in each of the following mutually exclusive 
situations: 

al) (i„, h,j„i) is a collider and h £ G U an(C); 

a2) in = jm with an arrowhead pointing to h on the inh-edge and h £ G U an(C); 

bl) {in,h,jm) is a non-collider and h ^ G; 

b2) i'fi — jVn with no arrowhead pointing to h on the i^h-edge. 

Proof. Let 7ri2 = tti o 7r2 = (i, . . . ip-i, fc, . . . ,j) be the combination of tti and 7r2. 
If fc = /i and either al) or bl) holds then the conclusion is obvious. The cases a2) or b2) 
are only relevant when k ^ h. 

Next consider the situation where k ^ h. Since tti and tt2 are m-connecting, for 7ri2 
to be m-connecting we only need to check the V-configuration /c, jq_i). We have 

to deal with two cases: 

Case 1: (ip-i , fc, jg_i) is a non-collider. In this case there is no arrowhead pointing to 
k from at least one of ip-i or jq-i. This means that (ip-i, k, ip+i) on tti or fc, jg+i) 

on 7r2 is a non-collider, and since tti and 7r2 were both m-connecting we have k ^ G. Hence 
7ri2 is m-connecting. 
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Case 2: fc, is a collider. We need to consider the following two subcases: 

Case 2.1. If {ip-i, fc, is a collider and any of {ip-i, k, ip+i) or {jq-i, fe, jq+i) is also 
a collider then k E C U an(C) and 7ri2 is 771-connecting. 

Case 2.2. If fc, is a collider but (ip_i, fc, ip+i) and fc, j^+i) arc both 

non-colliders then by Lemma 5, the subpath of tti from fc to a collider node h or to h is 
an anterior path and similarly for 7r2, I2, and h. However, since G* is an anterior graph 
and there are arrowheads pointing to k, these anterior paths must be direction-preserving 
and thus k G an(Zi) Uan(/i) and k 6 an(/2)Uan(/i). Now we have the two following further 
subcases: 

Case 2.2.1: One of the suhpaths of 1:1,112 from k to li,l2 is direction-preserving. Be- 
cause TTi and 1T2 are m-connecting we must have li or ^2 in C U an(C). Thus k € an(C) 
and 7ri2 is m-connecting. 

Case 2.2.2: Both subpaths of tti and 1T2 from k to h are direction-preserving. Then 
(imh,jm) is collider or i„ = jVn with an arrowhead pointing to h on the inh-edge and 
bl) and b2) are impossible. If al) or a2) holds 7ri2 is m-connecting since then h S 
CUan(C). □ 

We arc now ready to establish the main result of this paper. 

Theorem 3. Let G be a maximal rihbonless graph. If an independence model J over the 
node set of G is a compositional graphoid, then J satisfies the pairwise Markov property 
w.r.t. G if and only if it satisfies the global Markov property w.r.t. G. 

Proof. (<^) If J" is a compositional graphoid and satisfies the global Markov property 
it follows from Theorem 2 and Lemma 6 that it satisfies the pairwise Markov property. 

Now suppose that J satisfies the pairwise Markov property and compositional 
graphoid axioms. For subsets A, B, and C of the node set of G, we should prove that 
AJ-mB I C implies {A,B | C) G J". By composition, it is sufficient to show this when A 
and B are singletons, i.e. that i _L mj \ C implies {i,j \C) € J^. 

Further we observe that it is sufficient to establish the result in the case when G = G* it 
itself an anterior graph. For Proposition 1 gives that ALmB\C in G implies AJ-mB\C 
in G*. In addition, by Lemma 7, G* is a maximal graph. Moreover, G and G* have 
the same anterior sets, and therefore the same pairwise Markov property. Thus in the 
following we assume that G = G* is an anterior graph. 

We prove the result in two main parts. In part I we prove the result for the case that 
C C ant(i) U ant(j). In part II we use the result of part I to establish the general case. 

Part I. Suppose that G C ant(i) U ant(j). We use induction on the number of nodes of 
the graph. The induction base for a graph with two nodes is trivial. Thus suppose that 
the result holds for all anterior graphs with fewer than n nodes and assume that G* has 
n nodes. 
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Let D = {i} U {]} U ant(i) U ant(j) and M ^ V \ D, where V is the node set of the 
graph. Fh'st in case I.l we suppose that M ^ 0, and then in case 1.2 we suppose that 
M = 0. 

Case I.l. Consider G*[D] to be the subgraph induced by D. Consider the marginal 
independence model a( J, M) = {{A,B\C) : {A,B\C) € J and {A U B U C) H M = 
0} defined over D. By Lemma 8, a{J,M) is a compositional graphoid. In addition, 
it satisfies the pairwise Markov property: This is because two non-adjacent nodes li 
and I2 in G*[D] are non-adjacent in G* and by the pairwise Markov property for 
{li,l2 I ante* d) U antcih) \ {^1,^2}) £ J, where ante* is the anterior set in G* . We 
know that ant^. (/i) U antG.(/2) ^ D and hence antG'-«(^i) U anta-ih) Ci M = 0. In 
addition, for a node I in G*[D], antG*(0 = antG>-[_D](0- Therefore, {hjh \ s^^^c [D]{h) U 
aiitG'[D]il2)\{h,l2}) & a{J,M). 

We also know that i _L ,„ j | C in G* implies i _L | C in G* \D] since there is no m- 
connecting path between i and j given G in G* and by removing nodes and edges from 
G* no new m-connecting paths are generated. Therefore, by the induction hypothesis 
(i, j I G) e a(J, M). This implies that (i, j | C) e J. 

Case /.^. Now suppose that M = and thus the node set of G* \s D — {1} U 
{j} U ant(i) U ant(j). We prove the result by reverse induction on |G|: For the base, 
G — V \ ~ ant(i) U ant(j) \ {i, j} and the result follows trivially from the pairwise 
Markov property. 

For the inductive step, consider a node h ^ G. We want to show that h is not m- 
connected to both i and j: Suppose, for contradiction, there are m-connecting paths 
h) and tt2 = {h,jm,jm-i, • ■ • ,.7o = j) given G. If bl) or b2) of Lemma 9 
hold then i and j are m-connected given G which contradicts i 1. ,nj \ G. So we need only 
consider the cases where h,jrn) is collider or i„ = jm with an arrowhead pointing to 
h on the inh-edge. However, we know that h £ ant(i) or /i £ ant(j). Because of symmetry 
between i and j suppose that h S ant(i). Since G* is an anterior graph and there is an 
arrowhead pointing to h we have h G an(i). Hence there is a direction-preserving path 
TT from h to i. If no node on tt is in C then bl) or b2) of Lemma 9 implies that the 
combination of tt and 7r2 is an m-connecting path between i and again a contradiction. 
If there is a node on tt that is in G then h £ an(C) and again, by al) and a2) of Lemma 9, 
i and j are m-connected given C, again a contradiction. 

We conclude that, given C, h is not m-connected to both i and j. By symmetry 
suppose that i _L mh \ G. 

We also have that i^-mj \ G. Since Jm{G*) is a compositional graphoid (Theorem 1) 
the composition property gives that i ^-m{j^h} \ G. By weak union for _Lm we obtain 
i _L mj I {h} U G and i ± mh \ {j} U G. By the induction hypothesis we obtain \ {h} U 
G) € J and {i, h \ {j} U G) E J. By intersection we get {i, {j, h}\G) £ J . By decompo- 
sition we finally obtain (i, j \ G) E J . 

Part II. We now prove the result in the general case by induction on |C|. The base, i.e. 
the case that |C| = 0, follows from part I. To prove the inductive step we can assume 
that G ^ ant(z) U ant(j), since otherwise part I implies the result. 
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We first show that if C ^ ant(i) U ant(j) then there is a node I in C sucli that 
iJ-mj\C \ {I}: Let first G C \ (ant(i) U ant(j)) be arbitrary. If there is an I" £ 
C \ (ant(i) U ant(j)) so that /' e ant(/") and /" ^ ant(/') then replace I' by and 
repeat this process until it terminates, the latter being ensured by transitivity of ant 
(Lemma 1) and the finiteness of C. Thus we eventually obtain an I so that if I £ ant(Z) 
for I € C \ (ant(z) U ant(j)) then we also have / G ant(Z). 

Suppose, for contradiction, that there is a shortest m-connccting path tt between i 
and j given C \ {I}. If / is not on tt or is a collider on tt then tt is also m-connccting given 
C. Therefore, / is a non-collider on tt. This, together with I ^ ant(i) U ant(j), by using 
Lemma 5, implies that I is an anterior of a collider node p on tt. Since tt is m-connecting, 
p G C U an(C). Thus there is an ? S C so that p = I ot p Q an(/). Transitivity of anterior 
sets and the fact that I ^ (ant(i) U ant(j)) now imply that I G C \ (ant(z) U ant(j)). 
The construction of I implies I € ant(Z) which again implies that I G an(i) and I G an{l) 
and thus the collider V-configuration containing p is a cyclic ribbon unless its endpoints 
are adjacent with an cndpoint-identical edge, which implies that tt is not a shortest 
m-connecting path, a contradiction. 

We now have that either i J- ^1 \ C \ {l} oi j -L ml \ C \ {1} since otherwise, by Lemma 9 
there is an m-connecting path between i and j given C \ {/} in the case that Hs a non- 
collider or given C in the case that / is a collider node. Because of symmetry suppose that 
i-Lml\ C\{1}. By the induction hypothesis we have {i,j \ C\{1}) G J and {i, 1 1 C\{1}) € 
J . By the composition property we get (i, {j, V\\C \ {/}) G J . The weak union property 
implies (i, j \ C) G J . □ 

For the special case of probabilistic independence models we get the following corollary: 

Corollary 2. Let G he a maximal ribbonless graph. A probabilistic independence model 
that satisfies the intersection and composition axioms satisfies the pairwise Markov prop- 
erty w.r.t. G if and only if it satisfies the global Markov property w.r.t. G. 

6.3. Necessity of compositional graphoid axioms 

Theorem 3 states that, for equivalence of pairwise and global Markov properties, the 
six compositional graphoid axioms are sufficient. In fact, in general, for the mentioned 
equivalence, all six axioms are also necessary. The graphs in Fig. 9 show that the in- 
tersection and composition properties are necessary for the equivalence of pairwise and 
global Markov properties. 

For Gi ~ {V\,Ei), if Ji defined over Vi satisfies the pairwise Markov property then 
(i, k I {j, r\), {i, 1 1 {j, k}), and {k, 1 1 {i, j}) are in J'l. It can be seen that none of the com- 
positional semi-graphoid axioms can be used to imply (i, {k, 1} \ j) e Ji. The intersection 
property is the only axiom that implies the result. 

For G2 = {V2,E2), if k72 defined over V2 satisfies the pairwise Markov property then 
{i, fc I 0), (i, / 1 0), and (fc, I \ 0) arc in J'2. It can be seen that none of the graphoid axioms 
can be used to imply {i, {fc, 1} \ 0) G J2- The composition property is the only axiom that 
implies the result. 
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For G3 = (VijjE's), if defined over V3 satisfies the pairwise Markov property then 
(i, k I 0), {i, 1 1 {j, k}), and (fc, 1 1 {«, j}) are in jTs- It can be seen that none of the compo- 
sitional semi-graphoid axioms can be used to imply {l,{i,k} \j) £ jTa. The intersection 
property is the only axiom that implies the result. See also e.g. Example 3.26 of [19], 
showing that the pairwise Markov property does not imply the global Markov property 
for DAGs when intersection is violated. 

I I I 



i j k i ^ — ^ j ^ — ^ k i — ^ j ^ — k 

(a) (b) (c) 

Figure 9. For the equivalence of pairwise and global Markov properties, (a) an undirected graph 
Gi that shows that the intersection property is necessary; (b) a bidirected graph G2 that shows 
that the composition property is necessary; (c) a directed acyclic graph G3 that shows that the 
intersection property is necessary. 

It is known that, for undirected graphs, the five graphoid axioms are necessary and 
sufficient for equivalence of pairwise and global Markov properties; see [19]. For bidirected 
graphs, the independence statement associated with a missing edge between nodes i and 
j is (i, 7 I 0) and only the five compositional semi-graphoid axioms are necessary for 
equivalence of pairwise and global Markov properties. This can be inferred from the 
proof of Theorem 3, since part I of the proof is not relevant for bidirected graphs unless 
C = and the intersection property is not used in part II of the proof. We conclude by 
stating this as its own proposition. 

Proposition 2. Let G = (V, E) be a bidirected graph. If an independence model J 
defined over V is a compositional semi-graphoid then J satisfies the pairwise Markov 
property w.r.t. G if and only if it satisfies the global Markov property w.r.t. G. 
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